Update README.md
Browse files
README.md
CHANGED
|
@@ -1,469 +1,42 @@
|
|
| 1 |
---
|
|
|
|
| 2 |
tags:
|
| 3 |
- sentence-transformers
|
| 4 |
- sentence-similarity
|
|
|
|
|
|
|
| 5 |
- feature-extraction
|
| 6 |
-
-
|
| 7 |
-
-
|
| 8 |
-
- loss:CosineSimilarityLoss
|
| 9 |
-
widget:
|
| 10 |
-
- source_sentence: 'Central Bankers, Bureaucratic Incentives, and Monetary Policy:
|
| 11 |
-
an Introduction'
|
| 12 |
-
sentences:
|
| 13 |
-
- The idea that institutional structures, or property rights, affect individual
|
| 14 |
-
behavior is not a new one. In fact, this idea has been the focus of much of the
|
| 15 |
-
public choice/property rights literature of the past two decades. What is relatively
|
| 16 |
-
new is the application of this principle to central bank behavior. Until recently,
|
| 17 |
-
economists have tended to analyze central banks in a Utopian framework. The implicit
|
| 18 |
-
assumption has been that central banks conduct their operations to maximize the
|
| 19 |
-
public interest. Any deviation from the optimal rate of inflation could be attributed
|
| 20 |
-
to technical problems associated with unanticipated changes in the money multiplier
|
| 21 |
-
or money demand. Because of the assumptions employed, the economist’s role is
|
| 22 |
-
that of a technician, one who identifies the precise relationships between the
|
| 23 |
-
monetary base, the money supply, and the resulting inflation rate.
|
| 24 |
-
- This paper examines empirical evidence of the adaptive learning behavior of speculators
|
| 25 |
-
in the 2005–2008 renminbi appreciation episode, and establishes a theoretical
|
| 26 |
-
model to explore appreciation policy implications of such a behavior. In our model,
|
| 27 |
-
speculators form their expectations about the future appreciation premium adaptively
|
| 28 |
-
by extrapolating past appreciation returns into the future. We find that a rapid
|
| 29 |
-
appreciation may attract more capital inflows, and the central bank may prefer
|
| 30 |
-
a slow appreciation to discourage capital inflows. Simulated results can generate
|
| 31 |
-
hump-shaped paths of the appreciation speed, expected appreciation premium, and
|
| 32 |
-
capital inflows. In addition, changes in the appreciation speed precede changes
|
| 33 |
-
in the expected appreciation premium and capital inflows. These results are consistent
|
| 34 |
-
with empirical evidence in the 2005–2008 renminbi appreciation episode.
|
| 35 |
-
- This paper makes stochastic projections of the Central Bank of Chile’s (CBCh)
|
| 36 |
-
balance sheet (stocks and flows) starting from the actual current negative net
|
| 37 |
-
worth. These projections incorporate the effect on the balance sheet of several
|
| 38 |
-
macroeconomic variables as well as alternative policy decisions, taking into account
|
| 39 |
-
the uncertainty and risks inherent in the economy. The article describes and assesses
|
| 40 |
-
the main causes of the present deficit. In the baseline scenario, the deterministic
|
| 41 |
-
forecast shows that the Central Bank's net worth will increase slowly to reach
|
| 42 |
-
positive values after 25 years. However, in the stochastic simulations the Bank's
|
| 43 |
-
net worth will still be negative 25 years from now, with a 69 percent probability.
|
| 44 |
-
- source_sentence: New Forms of the Management to Loan Size and the Consequences--A
|
| 45 |
-
Factor Analysis of Money Expanding from 1996 to the Second Quarter of 2004
|
| 46 |
-
sentences:
|
| 47 |
-
- When the nominal return on all public liabilities is allowed to adjust to changing
|
| 48 |
-
market conditions, or the central bank has access to unlimited open market operations,
|
| 49 |
-
money growth is likely to stimulate output. This is shown in the model used by
|
| 50 |
-
Lucas in his Nobel Prize Lecture as an example of the non neutral effects of anticipated
|
| 51 |
-
monetary expansions. A rise in net outside assets increases households' incentives
|
| 52 |
-
to work through a reallocation of consumption across periods. This result survives
|
| 53 |
-
with non interest-bearing cash when the latter does not generate relevant distortions.
|
| 54 |
-
- central bank is insolvent if its plans imply a Ponzi scheme on reserves so the
|
| 55 |
-
price level becomes infinity. If the central bank enjoys fiscal support, in the
|
| 56 |
-
form of a dividend rule that pays out net income every period, including when
|
| 57 |
-
it is negative, it can never become insolvent independently of the fiscal authority.
|
| 58 |
-
Otherwise, this note distinguishes between intertemporal insolvency, rule insolvency,
|
| 59 |
-
and period insolvency. While period and rule solvency depend on analyzing dividend
|
| 60 |
-
rules and sources of risk to net income, evaluating intertemporal solvency requires
|
| 61 |
-
overcoming the difficult challenge of measuring the present value of seignorage.
|
| 62 |
-
- This study argues that financializations is not a phenomenon exclusively associated
|
| 63 |
-
with complex innovation in highly developed financial markets. Financialization
|
| 64 |
-
also affects countries with 'shallow' financial markets but with a significant
|
| 65 |
-
presence of transnational financial actors that become a powerful economic and
|
| 66 |
-
political force able to navigate and shape uneven regulatory and institutional
|
| 67 |
-
terrains in order to sustain new modes of profit generation. The study distinguishes
|
| 68 |
-
two stages in financialization of the Romanian economy. The first, central-bank
|
| 69 |
-
dominated stage saw the systemic transformation of firm-bank-state relations that
|
| 70 |
-
resulted in the creation of impatient banking and the forestalling of industrial
|
| 71 |
-
policy options. The second, dependent financialization, is in turn characterised
|
| 72 |
-
by new modes of profit generation for transnational financial actors, interconnectedness
|
| 73 |
-
and fragility as the main mechanism of incorporation in European financial structures.
|
| 74 |
-
- source_sentence: Central Bank Participation in Currency Options Markets
|
| 75 |
-
sentences:
|
| 76 |
-
- Abstract This paper investigates the impact of the central bank's liquidity operations
|
| 77 |
-
on the financial constraints of the bank-dependent firms. We use the Reserve Bank
|
| 78 |
-
of India's liquidity operation called Term Repo Operation (TRO) in the study.
|
| 79 |
-
The empirical analysis is based on a large scale firm-level data for the period
|
| 80 |
-
2011–2016 and panel logit estimation method. Our findings indicate that the financial
|
| 81 |
-
constraints of the bank-dependent firms have reduced than their counterparts since
|
| 82 |
-
the introduction of the operation. We also show that larger firms reap significant
|
| 83 |
-
benefit out of TRO than the smaller firms.
|
| 84 |
-
- Since 1990, Poland has adopted nearly all possible exchange rate regimes, moving
|
| 85 |
-
smoothly from fixed peg to pure floating. At the beginning of the transition period
|
| 86 |
-
exchange rate policy was aimed mainly at stabilising the economy. It supported
|
| 87 |
-
monetary policy in its fight against inflation, playing the role of a nominal
|
| 88 |
-
anchor. Under the stabilisation plan, the Polish authorities decided to fix the
|
| 89 |
-
zloty to the US dollar at the level of 9,500 zlotys to one US$ in January 1990.
|
| 90 |
-
In that way the government and the central bank delivered a strong nominal anchor
|
| 91 |
-
for an economy facing dramatic and abrupt adjustments in the real sector and in
|
| 92 |
-
relative prices. At the same time, the zloty became an internally convertible
|
| 93 |
-
currency.
|
| 94 |
-
- This paper models unemployment as a general equilibrium solution in labor and
|
| 95 |
-
capital markets, while the natural rate hypothesis explains unemployment simply
|
| 96 |
-
as a partial equilibrium in the labor market. It is shown that monetary policy
|
| 97 |
-
can have long-run effects by affecting required returns on capital and investment.
|
| 98 |
-
If monetary policy is primarily concerned with maintaining price stability, the
|
| 99 |
-
interaction between wage bargaining and the central bank’s credibility as an inflation
|
| 100 |
-
fighter becomes a crucial factor in determining employment. Different labor market
|
| 101 |
-
institutions condition different monetary policy reactions. With centralized wage
|
| 102 |
-
bargaining, a central bank mandate focusing primarily on price stability is sufficient.
|
| 103 |
-
With an atomistic labor market, the central bank must also consider output as
|
| 104 |
-
a policy objective.
|
| 105 |
-
- source_sentence: 'The politics of the European Central Bank: principal-agent theory
|
| 106 |
-
and the democratic deficit'
|
| 107 |
-
sentences:
|
| 108 |
-
- ABSTRACTThe stress test of the European Central Bank has become one of the primary
|
| 109 |
-
regulatory tools for the European banking system. In order to make such a regulatory
|
| 110 |
-
indicator, different national...
|
| 111 |
-
- This paper uses the VAR methodology to analyse the effects of European Central
|
| 112 |
-
Bank monetary policy shocks and euro area output and inflation shocks on the European
|
| 113 |
-
Union member states from Central and Eastern Europe. First, we look at the strength
|
| 114 |
-
of effects of identified euro area monetary policy shocks and compare the influence
|
| 115 |
-
with the one of the domestic policy shocks. Next, we turn to analysis of output
|
| 116 |
-
and inflation responses to euro area output and inflation shocks relative to aggregate
|
| 117 |
-
euro area reaction. We provide implications for each country monetary policy decision
|
| 118 |
-
process and draw conclusions concerning readiness for euro adoption, both from
|
| 119 |
-
monetary and real economy point of view.
|
| 120 |
-
- The "conservative central banker" has come under attack recently. On the basis
|
| 121 |
-
of models in which there is explicit interaction between trade union behavior
|
| 122 |
-
and monetary policy, it has been argued that if 'trade unions' are averse to inflation,
|
| 123 |
-
welfare will be lower with a conservative than with a liberal central bank. We
|
| 124 |
-
reframe this discussion in a standard trade union model. We show that the case
|
| 125 |
-
against the conservative central banker rests exclusively on the assumption of
|
| 126 |
-
a strictly nominal outside option (for instance, unemployment benefits) for the
|
| 127 |
-
union. There is no welfare gain associated with making the central bank less conservative
|
| 128 |
-
than society, however, if the outside option is in real terms. As the nominal
|
| 129 |
-
components of the trade union's outside option are mainly public transfers, we
|
| 130 |
-
also show that the conservative central banker is always optimal if the government
|
| 131 |
-
can choose the level of nominal unemployment benefits as well as the degree of
|
| 132 |
-
central bank conservatism.
|
| 133 |
-
- source_sentence: Global Liquidity and Monetary Policy Autonomy
|
| 134 |
-
sentences:
|
| 135 |
-
- In the 1970s monetarism was accepted as a principle of policy of central banks.
|
| 136 |
-
However monetary growth targets currently play no official role in the monetary
|
| 137 |
-
policy of industrial countries. The majority of monetary authorities excluded
|
| 138 |
-
money from their analysis. Inflation, GDP and unemployment are not determined
|
| 139 |
-
by money supply in their models. This article provides a historical perspective
|
| 140 |
-
on the development and apparent failure of monetarism as a policy guide.
|
| 141 |
-
- Deep financial crisis which started in 2007 proved to be extremely contagious
|
| 142 |
-
affecting the financial and banking EU system. Achieving an integrated banking
|
| 143 |
-
market is the main component of the European policy in the financial-banking services
|
| 144 |
-
area. The latest developments underlined that the difficulties faced by the banks
|
| 145 |
-
can negatively impact on the entire financial stability of the member states.
|
| 146 |
-
That's why, the European Central Bank will be entitled to supervise any bank of
|
| 147 |
-
the euro area, especially the ones that benefit of public support. Reforming the
|
| 148 |
-
financial and banking system must be shaped in the frame of insuring some durable
|
| 149 |
-
national finances, of an urgent recapitalizing of the banks that need that and
|
| 150 |
-
of elaborating some common fiscal and financial and banking regulations effective
|
| 151 |
-
in the eurozone
|
| 152 |
-
- We have confirmed that the cause of Japan’s recession has been the sharp reduction
|
| 153 |
-
in credit creation that began in 1992 and was triggered by the bad debts in the
|
| 154 |
-
banking system. We have also found that this was due to excessive loan growth
|
| 155 |
-
quotas imposed on the banks by the Bank of Japan during the 1980s. Finally, we
|
| 156 |
-
found that the problem of lack of credit during the 1990s could easily have been
|
| 157 |
-
solved through monetary policy. Bad debts could have been taken off the banks’
|
| 158 |
-
balance sheets without costs by the central bank. Even without bank lending, the
|
| 159 |
-
central bank could have created a recovery a decade ago, by significantly increasing
|
| 160 |
-
its own credit creation. In other words, Japan’s recession of the 1990s has been
|
| 161 |
-
the result of the Bank of Japan’s policies.2
|
| 162 |
-
pipeline_tag: sentence-similarity
|
| 163 |
library_name: sentence-transformers
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 164 |
---
|
| 165 |
|
| 166 |
-
#
|
| 167 |
-
|
| 168 |
-
This is a [sentence-transformers](https://www.SBERT.net) model trained. It maps sentences & paragraphs to a 1024-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
|
| 169 |
-
|
| 170 |
-
## Model Details
|
| 171 |
-
|
| 172 |
-
### Model Description
|
| 173 |
-
- **Model Type:** Sentence Transformer
|
| 174 |
-
<!-- - **Base model:** [Unknown](https://huggingface.co/unknown) -->
|
| 175 |
-
- **Maximum Sequence Length:** 300 tokens
|
| 176 |
-
- **Output Dimensionality:** 1024 dimensions
|
| 177 |
-
- **Similarity Function:** Cosine Similarity
|
| 178 |
-
<!-- - **Training Dataset:** Unknown -->
|
| 179 |
-
<!-- - **Language:** Unknown -->
|
| 180 |
-
<!-- - **License:** Unknown -->
|
| 181 |
-
|
| 182 |
-
### Model Sources
|
| 183 |
-
|
| 184 |
-
- **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
|
| 185 |
-
- **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
|
| 186 |
-
- **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
|
| 187 |
-
|
| 188 |
-
### Full Model Architecture
|
| 189 |
-
|
| 190 |
-
```
|
| 191 |
-
SentenceTransformer(
|
| 192 |
-
(0): Transformer({'max_seq_length': 300, 'do_lower_case': True}) with Transformer model: BertModel
|
| 193 |
-
(1): Pooling({'word_embedding_dimension': 1024, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
|
| 194 |
-
(2): Normalize()
|
| 195 |
-
)
|
| 196 |
-
```
|
| 197 |
-
|
| 198 |
-
## Usage
|
| 199 |
-
|
| 200 |
-
### Direct Usage (Sentence Transformers)
|
| 201 |
-
|
| 202 |
-
First install the Sentence Transformers library:
|
| 203 |
-
|
| 204 |
-
```bash
|
| 205 |
-
pip install -U sentence-transformers
|
| 206 |
-
```
|
| 207 |
-
|
| 208 |
-
Then you can load this model and run inference.
|
| 209 |
-
```python
|
| 210 |
-
from sentence_transformers import SentenceTransformer
|
| 211 |
-
|
| 212 |
-
# Download from the 🤗 Hub
|
| 213 |
-
model = SentenceTransformer("sentence_transformers_model_id")
|
| 214 |
-
# Run inference
|
| 215 |
-
sentences = [
|
| 216 |
-
'Global Liquidity and Monetary Policy Autonomy',
|
| 217 |
-
'In the 1970s monetarism was accepted as a principle of policy of central banks. However monetary growth targets currently play no official role in the monetary policy of industrial countries. The majority of monetary authorities excluded money from their analysis. Inflation, GDP and unemployment are not determined by money supply in their models. This article provides a historical perspective on the development and apparent failure of monetarism as a policy guide.',
|
| 218 |
-
"Deep financial crisis which started in 2007 proved to be extremely contagious affecting the financial and banking EU system. Achieving an integrated banking market is the main component of the European policy in the financial-banking services area. The latest developments underlined that the difficulties faced by the banks can negatively impact on the entire financial stability of the member states. That's why, the European Central Bank will be entitled to supervise any bank of the euro area, especially the ones that benefit of public support. Reforming the financial and banking system must be shaped in the frame of insuring some durable national finances, of an urgent recapitalizing of the banks that need that and of elaborating some common fiscal and financial and banking regulations effective in the eurozone",
|
| 219 |
-
]
|
| 220 |
-
embeddings = model.encode(sentences)
|
| 221 |
-
print(embeddings.shape)
|
| 222 |
-
# [3, 1024]
|
| 223 |
-
|
| 224 |
-
# Get the similarity scores for the embeddings
|
| 225 |
-
similarities = model.similarity(embeddings, embeddings)
|
| 226 |
-
print(similarities.shape)
|
| 227 |
-
# [3, 3]
|
| 228 |
-
```
|
| 229 |
-
|
| 230 |
-
<!--
|
| 231 |
-
### Direct Usage (Transformers)
|
| 232 |
-
|
| 233 |
-
<details><summary>Click to see the direct usage in Transformers</summary>
|
| 234 |
-
|
| 235 |
-
</details>
|
| 236 |
-
-->
|
| 237 |
-
|
| 238 |
-
<!--
|
| 239 |
-
### Downstream Usage (Sentence Transformers)
|
| 240 |
-
|
| 241 |
-
You can finetune this model on your own dataset.
|
| 242 |
-
|
| 243 |
-
<details><summary>Click to expand</summary>
|
| 244 |
-
|
| 245 |
-
</details>
|
| 246 |
-
-->
|
| 247 |
-
|
| 248 |
-
<!--
|
| 249 |
-
### Out-of-Scope Use
|
| 250 |
-
|
| 251 |
-
*List how the model may foreseeably be misused and address what users ought not to do with the model.*
|
| 252 |
-
-->
|
| 253 |
-
|
| 254 |
-
<!--
|
| 255 |
-
## Bias, Risks and Limitations
|
| 256 |
-
|
| 257 |
-
*What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
|
| 258 |
-
-->
|
| 259 |
-
|
| 260 |
-
<!--
|
| 261 |
-
### Recommendations
|
| 262 |
-
|
| 263 |
-
*What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
|
| 264 |
-
-->
|
| 265 |
-
|
| 266 |
-
## Training Details
|
| 267 |
-
|
| 268 |
-
### Training Dataset
|
| 269 |
-
|
| 270 |
-
#### Unnamed Dataset
|
| 271 |
-
|
| 272 |
-
* Size: 29,026 training samples
|
| 273 |
-
* Columns: <code>sentence_0</code>, <code>sentence_1</code>, and <code>label</code>
|
| 274 |
-
* Approximate statistics based on the first 1000 samples:
|
| 275 |
-
| | sentence_0 | sentence_1 | label |
|
| 276 |
-
|:--------|:----------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:---------------------------------------------------------------|
|
| 277 |
-
| type | string | string | float |
|
| 278 |
-
| details | <ul><li>min: 6 tokens</li><li>mean: 15.09 tokens</li><li>max: 48 tokens</li></ul> | <ul><li>min: 18 tokens</li><li>mean: 134.34 tokens</li><li>max: 288 tokens</li></ul> | <ul><li>min: 0.0</li><li>mean: 0.48</li><li>max: 1.0</li></ul> |
|
| 279 |
-
* Samples:
|
| 280 |
-
| sentence_0 | sentence_1 | label |
|
| 281 |
-
|:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-----------------|
|
| 282 |
-
| <code>Half-Life Estimation under the Taylor Rule ∗</code> | <code>Abstract We aim to investigate the simultaneous and interacted effects of central bank qualitative and quantitative communication on private inflation expectations, measured with survey and market-based measures. The effects of ECB inflation projections and Governing Council members’ speeches are identified through an instrumental-variables estimation using a principal component analysis to generate relevant instruments. We find that ECB projections have a positive effect on current-year forecasts, and that ECB projections and speeches are substitutes at longer horizons. Moreover, ECB speeches and the ECB rate reinforce the effect of ECB projections when they are consistent, and convey the same signal about inflationary pressures.</code> | <code>0.0</code> |
|
| 283 |
-
| <code>Lasting Legacy on Financial Regulatory reform</code> | <code>At their meeting on October 23, the G20 finance ministers and central bank governors prioritized a number of regulatory issues for discussion at the leader’s summit. The familiar items included a ritualistic commitment to implement all reforms endorsed already by the G20 in an internationally consistent and non-discriminatory manner, such as those relating to over-the-counter derivatives, compensation practices, accounting standards and credit rating agencies.</code> | <code>1.0</code> |
|
| 284 |
-
| <code>Tři roky po měnové krizi: rekapitulace událostí a jejich souvislostí [Three years after the exchange rate crisis: recapitulation of events and their connections]</code> | <code>We examine the European Central Bank's ad-hoc communication and explore how it informs future monetary policy decisions. Using the rich dataset of the inter-meeting verbal communication among the members of the European Central Bank's Governing Council between 2008 and 2014, we construct a measure of communication assessing its inclination towards easing, tightening or maintaining the monetary policy stance. We find that this measure provides useful additional information about future monetary policy decisions, even when we control for market-based interest rate expectations and lagged decisions. Our results also suggest that, in particular, communication shortly before monetary policy meetings, related to unconventional measures and/or by the ECB President explain the future ECB rate changes well. Overall, these results point to the importance of transparency in understanding the future course of monetary policy.</code> | <code>0.0</code> |
|
| 285 |
-
* Loss: [<code>CosineSimilarityLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#cosinesimilarityloss) with these parameters:
|
| 286 |
-
```json
|
| 287 |
-
{
|
| 288 |
-
"loss_fct": "torch.nn.modules.loss.MSELoss"
|
| 289 |
-
}
|
| 290 |
-
```
|
| 291 |
-
|
| 292 |
-
### Training Hyperparameters
|
| 293 |
-
#### Non-Default Hyperparameters
|
| 294 |
-
|
| 295 |
-
- `per_device_train_batch_size`: 16
|
| 296 |
-
- `per_device_eval_batch_size`: 16
|
| 297 |
-
- `num_train_epochs`: 1
|
| 298 |
-
- `multi_dataset_batch_sampler`: round_robin
|
| 299 |
-
|
| 300 |
-
#### All Hyperparameters
|
| 301 |
-
<details><summary>Click to expand</summary>
|
| 302 |
-
|
| 303 |
-
- `overwrite_output_dir`: False
|
| 304 |
-
- `do_predict`: False
|
| 305 |
-
- `eval_strategy`: no
|
| 306 |
-
- `prediction_loss_only`: True
|
| 307 |
-
- `per_device_train_batch_size`: 16
|
| 308 |
-
- `per_device_eval_batch_size`: 16
|
| 309 |
-
- `per_gpu_train_batch_size`: None
|
| 310 |
-
- `per_gpu_eval_batch_size`: None
|
| 311 |
-
- `gradient_accumulation_steps`: 1
|
| 312 |
-
- `eval_accumulation_steps`: None
|
| 313 |
-
- `torch_empty_cache_steps`: None
|
| 314 |
-
- `learning_rate`: 5e-05
|
| 315 |
-
- `weight_decay`: 0.0
|
| 316 |
-
- `adam_beta1`: 0.9
|
| 317 |
-
- `adam_beta2`: 0.999
|
| 318 |
-
- `adam_epsilon`: 1e-08
|
| 319 |
-
- `max_grad_norm`: 1
|
| 320 |
-
- `num_train_epochs`: 1
|
| 321 |
-
- `max_steps`: -1
|
| 322 |
-
- `lr_scheduler_type`: linear
|
| 323 |
-
- `lr_scheduler_kwargs`: {}
|
| 324 |
-
- `warmup_ratio`: 0.0
|
| 325 |
-
- `warmup_steps`: 0
|
| 326 |
-
- `log_level`: passive
|
| 327 |
-
- `log_level_replica`: warning
|
| 328 |
-
- `log_on_each_node`: True
|
| 329 |
-
- `logging_nan_inf_filter`: True
|
| 330 |
-
- `save_safetensors`: True
|
| 331 |
-
- `save_on_each_node`: False
|
| 332 |
-
- `save_only_model`: False
|
| 333 |
-
- `restore_callback_states_from_checkpoint`: False
|
| 334 |
-
- `no_cuda`: False
|
| 335 |
-
- `use_cpu`: False
|
| 336 |
-
- `use_mps_device`: False
|
| 337 |
-
- `seed`: 42
|
| 338 |
-
- `data_seed`: None
|
| 339 |
-
- `jit_mode_eval`: False
|
| 340 |
-
- `use_ipex`: False
|
| 341 |
-
- `bf16`: False
|
| 342 |
-
- `fp16`: False
|
| 343 |
-
- `fp16_opt_level`: O1
|
| 344 |
-
- `half_precision_backend`: auto
|
| 345 |
-
- `bf16_full_eval`: False
|
| 346 |
-
- `fp16_full_eval`: False
|
| 347 |
-
- `tf32`: None
|
| 348 |
-
- `local_rank`: 0
|
| 349 |
-
- `ddp_backend`: None
|
| 350 |
-
- `tpu_num_cores`: None
|
| 351 |
-
- `tpu_metrics_debug`: False
|
| 352 |
-
- `debug`: []
|
| 353 |
-
- `dataloader_drop_last`: False
|
| 354 |
-
- `dataloader_num_workers`: 0
|
| 355 |
-
- `dataloader_prefetch_factor`: None
|
| 356 |
-
- `past_index`: -1
|
| 357 |
-
- `disable_tqdm`: False
|
| 358 |
-
- `remove_unused_columns`: True
|
| 359 |
-
- `label_names`: None
|
| 360 |
-
- `load_best_model_at_end`: False
|
| 361 |
-
- `ignore_data_skip`: False
|
| 362 |
-
- `fsdp`: []
|
| 363 |
-
- `fsdp_min_num_params`: 0
|
| 364 |
-
- `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
|
| 365 |
-
- `fsdp_transformer_layer_cls_to_wrap`: None
|
| 366 |
-
- `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
|
| 367 |
-
- `deepspeed`: None
|
| 368 |
-
- `label_smoothing_factor`: 0.0
|
| 369 |
-
- `optim`: adamw_torch
|
| 370 |
-
- `optim_args`: None
|
| 371 |
-
- `adafactor`: False
|
| 372 |
-
- `group_by_length`: False
|
| 373 |
-
- `length_column_name`: length
|
| 374 |
-
- `ddp_find_unused_parameters`: None
|
| 375 |
-
- `ddp_bucket_cap_mb`: None
|
| 376 |
-
- `ddp_broadcast_buffers`: False
|
| 377 |
-
- `dataloader_pin_memory`: True
|
| 378 |
-
- `dataloader_persistent_workers`: False
|
| 379 |
-
- `skip_memory_metrics`: True
|
| 380 |
-
- `use_legacy_prediction_loop`: False
|
| 381 |
-
- `push_to_hub`: False
|
| 382 |
-
- `resume_from_checkpoint`: None
|
| 383 |
-
- `hub_model_id`: None
|
| 384 |
-
- `hub_strategy`: every_save
|
| 385 |
-
- `hub_private_repo`: None
|
| 386 |
-
- `hub_always_push`: False
|
| 387 |
-
- `gradient_checkpointing`: False
|
| 388 |
-
- `gradient_checkpointing_kwargs`: None
|
| 389 |
-
- `include_inputs_for_metrics`: False
|
| 390 |
-
- `include_for_metrics`: []
|
| 391 |
-
- `eval_do_concat_batches`: True
|
| 392 |
-
- `fp16_backend`: auto
|
| 393 |
-
- `push_to_hub_model_id`: None
|
| 394 |
-
- `push_to_hub_organization`: None
|
| 395 |
-
- `mp_parameters`:
|
| 396 |
-
- `auto_find_batch_size`: False
|
| 397 |
-
- `full_determinism`: False
|
| 398 |
-
- `torchdynamo`: None
|
| 399 |
-
- `ray_scope`: last
|
| 400 |
-
- `ddp_timeout`: 1800
|
| 401 |
-
- `torch_compile`: False
|
| 402 |
-
- `torch_compile_backend`: None
|
| 403 |
-
- `torch_compile_mode`: None
|
| 404 |
-
- `include_tokens_per_second`: False
|
| 405 |
-
- `include_num_input_tokens_seen`: False
|
| 406 |
-
- `neftune_noise_alpha`: None
|
| 407 |
-
- `optim_target_modules`: None
|
| 408 |
-
- `batch_eval_metrics`: False
|
| 409 |
-
- `eval_on_start`: False
|
| 410 |
-
- `use_liger_kernel`: False
|
| 411 |
-
- `eval_use_gather_object`: False
|
| 412 |
-
- `average_tokens_across_devices`: False
|
| 413 |
-
- `prompts`: None
|
| 414 |
-
- `batch_sampler`: batch_sampler
|
| 415 |
-
- `multi_dataset_batch_sampler`: round_robin
|
| 416 |
-
|
| 417 |
-
</details>
|
| 418 |
-
|
| 419 |
-
### Training Logs
|
| 420 |
-
| Epoch | Step | Training Loss |
|
| 421 |
-
|:------:|:----:|:-------------:|
|
| 422 |
-
| 0.2755 | 500 | 0.0844 |
|
| 423 |
-
| 0.5510 | 1000 | 0.0667 |
|
| 424 |
-
| 0.8264 | 1500 | 0.0607 |
|
| 425 |
|
|
|
|
| 426 |
|
| 427 |
-
##
|
| 428 |
-
- Python: 3.11.13
|
| 429 |
-
- Sentence Transformers: 4.1.0
|
| 430 |
-
- Transformers: 4.52.4
|
| 431 |
-
- PyTorch: 2.7.1+cu126
|
| 432 |
-
- Accelerate: 1.7.0
|
| 433 |
-
- Datasets: 2.14.4
|
| 434 |
-
- Tokenizers: 0.21.1
|
| 435 |
|
| 436 |
-
##
|
| 437 |
|
| 438 |
-
|
| 439 |
|
| 440 |
-
###
|
| 441 |
-
```bibtex
|
| 442 |
-
@inproceedings{reimers-2019-sentence-bert,
|
| 443 |
-
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
|
| 444 |
-
author = "Reimers, Nils and Gurevych, Iryna",
|
| 445 |
-
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
|
| 446 |
-
month = "11",
|
| 447 |
-
year = "2019",
|
| 448 |
-
publisher = "Association for Computational Linguistics",
|
| 449 |
-
url = "https://arxiv.org/abs/1908.10084",
|
| 450 |
-
}
|
| 451 |
-
```
|
| 452 |
|
| 453 |
-
|
| 454 |
-
## Glossary
|
| 455 |
|
| 456 |
-
|
| 457 |
-
-->
|
| 458 |
|
| 459 |
-
|
| 460 |
-
## Model Card Authors
|
| 461 |
|
| 462 |
-
|
| 463 |
-
-->
|
| 464 |
|
| 465 |
-
|
| 466 |
-
## Model Card Contact
|
| 467 |
|
| 468 |
-
|
| 469 |
-
-->
|
|
|
|
| 1 |
---
|
| 2 |
+
license: mit
|
| 3 |
tags:
|
| 4 |
- sentence-transformers
|
| 5 |
- sentence-similarity
|
| 6 |
+
- central-bank
|
| 7 |
+
- semantic-textual-similarity
|
| 8 |
- feature-extraction
|
| 9 |
+
- dataset:sentence-transformers/s2orc
|
| 10 |
+
- dataset:allenai/peS2o
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 11 |
library_name: sentence-transformers
|
| 12 |
+
pipeline_tag: sentence-similarity
|
| 13 |
+
base_model: BAAI/bge-large-en-v1.5
|
| 14 |
+
language: en
|
| 15 |
+
datasets:
|
| 16 |
+
- allenai/peS2o
|
| 17 |
+
- sentence-transformers/s2orc
|
| 18 |
---
|
| 19 |
|
| 20 |
+
# bge-centralbank
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 21 |
|
| 22 |
+
**bge-centralbank** is a domain-adapted Sentence Transformer developed to assess semantic similarity in central bank-related texts. It is based on [`BAAI/bge-large-en-v1.5`](https://huggingface.co/BAAI/bge-large-en-v1.5), and adapted through both unsupervised and supervised training.
|
| 23 |
|
| 24 |
+
## Training Setup
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 25 |
|
| 26 |
+
### 1. Pretraining
|
| 27 |
|
| 28 |
+
The model was first pretrained using TSDAE on 177,842 title–abstract pairs drawn from the `peS2o` dataset — a lightweight and pre-cleaned subset of the full S2ORC corpus. From this dataset, a domain-specific corpus was constructed by filtering on keywords relevant to macroeconomics, monetary policy, and financial markets. This step enables the model to better capture the language structures and terminology common in central bank literature.
|
| 29 |
|
| 30 |
+
### 2. Supervised Fine-tuning
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 31 |
|
| 32 |
+
Fine-tuning was conducted using a subset of the `sentence-transformers/s2orc` dataset. All **title–abstract pairs** were selected where the abstract contained the term *central bank*, resulting in 15,513 positive examples (label = 1). Each pair represents a real paper, with a matching title and abstract.
|
|
|
|
| 33 |
|
| 34 |
+
An equal number of negative examples (label = 0) was generated by randomly mismatching titles and abstracts from unrelated papers.
|
|
|
|
| 35 |
|
| 36 |
+
A separate validation set of 2,000 title–abstract pairs (1,000 positive, 1,000 negative) was held out during training. The remaining 29,026 examples were used for supervised training with `CosineSimilarityLoss` using the `sentence-transformers` framework.
|
|
|
|
| 37 |
|
| 38 |
+
## Evaluation
|
|
|
|
| 39 |
|
| 40 |
+
The model was evaluated on the held-out validation set. Compared to the base model, `bge-centralbank` achieved a substantially lower average similarity score on negative pairs (from **0.6230** to **0.1177**), showing improved ability to distinguish semantically unrelated text. It also achieved a higher **point-biserial correlation** (from **0.7933** to **0.9025**), indicating better alignment with binary STS labels.
|
|
|
|
| 41 |
|
| 42 |
+
---
|
|
|